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We reformulate the information causality in a more general framework by adopting the results of 
signal propagation and computation in a noisy circuit. In our framework, the information causality 
leads to a broad class of Tsirelson inequalities. This fact allows us to subject information causal- 
ity to experimental scrutiny. A no-go theorem for reliable nonlocal computation is also derived. 
Information causality prevents any physical circuit from performing reliable computations. 



I. INTRODUCTION 

As a physical theory, quantum mechanics has been ex- 
tremely successful in describing the microscopic physics. 
Nevertheless, its current framework is incapable of ex- 
plaining the nature of quantum entanglement. Attempts 
to remedy this situation have been made by reconstruct- 
ing quantum mechanics in terms of physical principles. 
These physical principles should be able to yield or con- 
strain the non-local correlation implied by the quantum 
entanglement. One such candidate is the principle of 
space-time causality. This principle will constrain the 
possible non-local correlation such that any physical the- 
ory must be no-signaling [1], i.e., signal cannot be send 
in the way of violating causality. However, a broad class 
of no-signaling theories other than quantum mechanics 
exist. Certain features, usually thought of as specifically 
quantum, are common for many of these theories [2, 3]. 
Clearly, no-signaling is insufficient as a principle to single 
out quantum mechanics. 

Some of these theories are allowed to have more non- 
local correlation than quantum mechanics [2-6]. Specif- 
ically, the non-local correlation in these theories can 
violate Bell-type inequalities by more than Tsirelson's 
bound [1, 7]. From this perspective, we should search for 
a physical principle as follows. The principle can single 
out Tsirelson's bound as a limitation on the extent of the 
allowed correlation for a physical theory. With the ad- 
vent of quantum information science, some principles of 
information theoretic flavor have been proposed. These 
proposed candidates set the constraints on the physi- 
cally realizable correlations. In this Letter, we focus on a 
promising candidate — the information causality. Infor- 
mation causality states that, in a bipartite code protocol 
prepared with any physically local or non-local resources, 
the accessible information gain cannot exceed the amount 
of classical communication. In [8] information causality 
is demonstrated by a generic task similar to random ac- 
cess codes (RAC) and oblivious transfer. In this task, a 
database of k bits is prepared: a := (a ,ai,--- ,dk-\), 
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where each aj is a random variable, which is only known 
by the first party, Alice. A second, distant party, Bob, is 
given a random variable fee (0, • • • , k — 1) along with a 
bit a send by Alice. With the bit a and the pre-shared 
correlation with Alice, Bob's task is to optimally guess 
the bit at. Then, according to information causality the 
quantity I has an upper bound 

fe-i 

I = ^2l(oi;P\b = i)<l. (1) 

i=0 

Here I(ai;/3\b = i) is the Shannon mutual information 
between a, and Bob's guessing bit j3 under the condi- 
tion b = i. Classically, I can reach 1 once a — ai and 
I{ai]a,j) = Sij ( i.e., the Kronecker delta). 

To perform the RAC task, Alice and Bob can use (ear- 
lier prepared and distributed) correlations among either 
classical or quantum systems. These no-signaling cor- 
relation resources can be simulated by the no-signaling 
box (NS-box) . The NS-box correlates the inputs and out- 
puts of Alice and Bob in an imperfect way subjected to 
the probabilistic noise. The noise of NS-box is intrinsi- 
cally inherited from the underlying physical theory such 
as quantum mechanics. The quantity I in (1) is unavoid- 
ably affected by the intrinsic noise of NS-box. In this 
framework, the signal decay theorem in [9, 10] for a noisy 
circuit is exploited to yield a tight bound for I(af,;/3|fe) 
in terms of noise of NS-box. According to information 
causality, the tight bound should also obey the upper 
bound in (1). By expressing the tight bound in terms 
of correlation functions between Alice's and Bob's mea- 
surement outcomes, this then yields our main result - 
a broad class of multi-setting Tsirelson-type inequalities. 
As a result, we can then subject the physical principle of 
information causality to scrutiny by experimentally ver- 
ifying or falsifying the generalized Tsirelson's bounds. 

Without classical communication, the RAC can be re- 
garded as nonlocal computation. Therein, distant Alice 
and Bob compute a general Boolean function without 
knowing the other's input. Here, NS-box can be regarded 
as a noisy gate for non-local computation [11]. Noise of 
the gate is closely related to the reliability of non-local 
computation. The computational noise of the gate is re- 
lated to the intrinsic reliability of the physically realized 
NS-box. In this aspect, we can tackle a fundamental 
question on noisy computation with its nonlocal version. 
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As raised by von Neumann [12], this question is origi- 
nally stated as follows. Could physical circuits of finite 
size perform the reliable noisy non-local computation of 
any Boolean function? Based on constraint by the infor- 
mation causality for any physical circuit, we will see that 
the answer is negative in non-local computation. 

The paper is organized as follows. In the next section 
we derive the Tsirelson-type inequalities from the infor- 
mation causality by using the theorem of signal propa- 
gation. In the section III we discuss the implication of 
information causality on the nonlocal quantum compu- 
tation and yield a no-go theorem for reliable nonlocal 
quantum computation. Finally we briefly conclude our 
paper in section IV. Moreover, in the Appendix we give 
the details of verifying our newly-derived Tsirelson-type 
inequalities by using the method of semidefinite program- 
ing. 



as follows 



II. TSIRELSON-TYPE INEQUALITIES 



We start by reformulating the NS-box as a noisy dis- 
tributed gate for nonlocal computation. The NS-box is 
initially distributed between two distant parties, Alice 
and Bob. Locally, Alice and Bob input bit strings x 
and y, respectively, into half of the box, which then out- 
puts bits A3 and Bg, respectively. The lengths of the 
bit strings can be chosen by design. Our NS-box is fur- 
ther characterized by the conditional joint probabilities 
Pr [A3 + Bg = f(x, y) \x,y\. Therein, f(x, y) is the task 
function. Notably, if the NS-box is physically realizable, 
these joint probabilities must fulfill the no-signaling con- 
ditions. 

In the RAC protocol, the chosen task function f(x,y) 
depends on how we encode Alice's database a and Bob's 
given random variable b into x and y, respectively. From 
now on, we will implicitly use the following protocol. 
Firstly, Alice encodes her database a into the (k — l)-bit 
string x := (xi, • • • , x/c_i) by Xj = a + a- L . Alice's half 
of NS-box then produces an outcome A3. At the same 
time, Bob encodes the given b in to (k — l)-bit string 
V ■= (Vi, ■■■ , Vk-i) by y l = 6 bti for 6^0, and y = for 
6 = 0. Bob's half of NS-box then produces an outcome 
Bg. Secondly, Alice sends Bob a bit a = a + A3. The 
optimal strategy for Bob's task is to output a guess bit 
j3 = a + Bg. As a result, Bob can decode Alice's bit 
a b successfully whenever A3 + Bg = x ■ y (modulo 2) is 
true. Most of the calculations in this Letter are modulo-2 
defined. 

In quantum mechanics, Alice's and Bob's outcomes can 
be produced by performing the corresponding measure- 
ment of 2 fe_1 and k settings, respectively. For the above 
protocol, the success probability of Bob's task in guessing 
Alice's bit a b is related to the one for noisy computation 



Pr[/3 = a b \b } = — ]T Pr [A s + Bg = f(x, y)\x,y] , 

x m 

(2) 

where N3 is the cardinality of the input space spanned by 
the encoding {x}. By defining the correlation functions 
between Alice's and Bob's measurement outcomes as 
C S ,ff := E^E^it-l^Pr^Bgl^l, 
we find 



x {S} 



(3) 



where the coding noise parameter is defined as £g := 
2 Pr [/3 = a b \b ] — 1. The sub-index y of £^ is understood 
to be equivalent to Bob's given parameter b via encoding. 

One of the main results of this paper is a broad class of 
Tsirelson's bound implied by information causality, i.e., 
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^Cg,ft\<Vk. 



(4) 



For k = 2, it is easy to check that (4) is the Tsirelson's 
bound |C ,o + C ,i + Ci, - C M | < 2^2 [7]. For the 
case of k > 2 with f(x,y) = x ■ y, we have verified (4) to 
be the Tsirelson's bound in quantum mechanics by using 
the semidefinite programing [14]. Please see Appendix 
for more detailed discussions. 

Indeed, later we will see that information causality will 
render (4). This implies that information causality can be 
tested by experimental verification or refutation via the 
measurement of the correlation functions of a quantum 
system. 

In order to arrive the Tsirelson's bound (4) from the 
information causality constraint (1), we need to relate 
I(a b ,f3\b) to It turns out that this can be done by 
using the following signal decay theorem on the signal 
propagation [9, 10]. 

Theorem 1: Let X, Y and Z be Boolean random vari- 
ables. Consider a cascade of two communication chan- 
nels: X Y Z. X and Y are the input and the 
output of the first channel, respectively. Let Y in turn 
be the input of a cascading binary symmetric channel C e 
with a noise parameter e, i.e., 



1(1 

2 (l-e) |(l + e) 



Let Z be the output of C e , (i.e., Z 
flipping probability \{l — e)) 

I(X;Z) 
I(X;Y)~ ■ 



Y with the bit- 



(5) 



A special case arises if the first channel is noiseless or 
trivial, i.e., I(X; Y = X) = 1 such that I(X;Z) < e 2 . 
Note also that regardless of the properties of the second 
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channel, there is a data processing inequality I(X; Z) < 
I(X;Y). 

We apply this theorem to our RAC protocol as fol- 
lows. Because Alice's database do, o,\, ■ ■ ■ , a^-i are ran- 
dom variables and independent of each other, so that 
all the Oj's with j ^ i can be fixed without disturbing 
I{a,i ;f3\b). Let X — a i: Y = a + f(x,y) , and Z = j3. 
Here Y is Bob's ideal answer and hence I(X;Y) = 1. 
The coding noise e for our protocol is ^j, then according 
to the Theorem 1, we have 



I(aap\b = i)<& 



(6) 



Therefore, the information causality in Eq. (1) yields 



(7) 



In [8], similar inequalities are derived to avoid the di- 
vergence of /, which justifies the information causality. 
However, such trouble does not exist in our reformula- 
tion because of the tight bound of Theorem 1. With 
the help of (3) the second inequality in (7) becomes a 
quadratic Tsirelson-type inequality for the correlation 
function Cg t g. Moreover, using the Cauchy-Schwarz in- 
equality, we can obtain | — V^j which results in 
the linear Tsirelson inequality of Eq. (4) . 



III. NOISY NONLOCAL COMPUTATION 

In the previous discussion we have considered the in- 
formation causality using a single nonlocal NS-box. In- 
stead, we can treat the NS-box as a non-local gate for 
performing the nonlocal computation, i.e., computing 
the function f(x,y) [11]. Unlike using the same gate 
for the RAC, no classical communication between Alice 
and Bob is required to perform the nonlocal computa- 
tion. In details, Alice's and Bob's local outputs are A$ 
and Bjj, respectively. The computation is successful if 
Ag + Btf = f(x,y). The computational noise parameter 
is defined as 



2¥t[A 3 + B$ = f{xj)\x,y\-l. 



(8) 



From (8) and (3) the computational noise of the gate is 
related to its coding noise by 



(9) 



{■?} 



Basically, computational errors inherently come from the 
gate noise. Information causality constraints the noisy 
extent of the NS-box as a gate. From this perspective, 
information causality is deeply connected with nonlocal 
computation. 

Furthermore, we can combine the NS-box gates to form 
a more complicated circuit without worrying about the 
coding protocol. Then the total task function for the 
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FIG. 1: RAC protocol for a (n, k, Z)-circuit. Each vertex of 
the circuit corresponds to a NS-box, with its details shown in 
the big ellipses. 



whole circuit will be a complicated function, i.e., a com- 
posite of task functions of all NS-boxes. We can then 
try to answer the following fundamental question: could 
a noiseless (nonlocal) computation be simulated using a 
noisy nonlocal physical resource ? 

Specifically, we consider the so-called (n, k, Z)-circuit, 
G, formed by cascading layers of noisy gates into a circuit 
in the form of a directed, acyclic tree (see Fig 1). On the 
top of G, there are n inputs to the NS-boxes — the leaves; 
at the bottom there is only one NS-box — the root. The 
longest path from the leaves to the root is called the 
depth of the circuit, denoted by I. The maximum input 
number of a gate in G is k. Note that, in [8] G comprises 
k = 2 gates and is exploited to compress n bits of x into 
one bit A$. However, there is no restriction on the task 
function for each NS-box, as long as the final circuit is a 
consistent acyclic tree diagram. 

We then use the circuit G to perform the follow- 
ing nonlocal computation. Alice's n-bit database a := 
(do, ffli, • • • , a n — i) is given to the leaves of G, and a con- 
ditional input b € {0, 1, • • • , n — 1} is given to the distant 
Bob. The previous encoding a — > "a? and b — > y for 
the RAC protocol, is also exploited here. Alice's out- 
put is properly encoded and then fed into the NS-box at 
the next layer, again with Bob's conditional input. The 
same procedure is performed recursively until reaching 
the root, with its output as the answer to the total task 
function at the root. 

Alternatively, Bob's decoding gates can be thought to 
be noise free, and the computational noise is only due 
to Alice's encoding gates, and vice versa. This makes it 
easier to understand the above procedure of noisy com- 
putation. Now we can consider the information flow of 
G. 

Theorem 2: For a noisy, local circuit G with an ar- 
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bitrary depth, the root outputs at most one-bit informa- 
tion. 

Note that the circuit G can perform the RAC if the 
appropriate protocol is given at each layer and 1-bit com- 
munication is allowed for the whole process. Then, the 
above theorem implies that information causality holds 
true for the circuit G. 

To prove the theorem, we will show that the mu- 
tual information between the leaves and the root of G 
is bounded by one. This can be done by mathemat- 
ical induction as follows. We begin with a circuit of 
depth one, which is nothing but a single NS-box; infor- 
mation causality ensures the bound. We then assume 
that the bound holds true for a circuit of depth I. Ac- 
cording to information causality and sub-additivity, the 
mutual information ii between the leaves and the root 
obeys /, (m) < J2 lm I(X im ,R m ) = Y. %m I{X im ; -R m |Bob's 
knowledge) < 1, where the index m labels a collection of 
circuits of depth I with root R m , and the index i m labels 
the inputs of the m-th circuit. Now, we construct a cir- 
cuit of depth £+1 by connecting all roots i? m 's to a single 
NS-box whose output is R. Then, the mutual information 
Ie+i between leaves and root R of the final circuit should 
obey the subadditivity, i.e., Ig + i < £ m £ im ^P^imi-^)- 
From Theorem 1, we have I(X im ;R) < (f n I(X lm ;R m ) 
because we have a cascade of two channels: X im 
R m <—t R where the second channel is a binary sym- 
metric one with the noise £ m . Using this result, we have 
h+i < £ ro &£i m I{X im -R m ) < < 1. Q.E.D. 

Here, we have only considered the case in which the 
computational noise is isotropic to x, denoted by eg. 
From (9) we have eg — £g and the information causality 
requires £{^j e| < 1- We would like to know whether the 
reliable computation is also constrained by the informa- 
tion causality or not. To check this, we invoke the main 
Evans-Schulman theorem on the conditions for reliable 
noisy computation as follows [9, 10]. 

Evans-Schulman Theorem: A circuit of complete fc-ary 
tree with depth I ( i.e., n = k l ) can perform 8- reliable 
noisy computation only 

• (i) if E{y } 4 > 1 then I > log(nA)/ log(£ {y - } el) , 

• (ii) if E { y } 4 ^ 1 thcn n ^ V A > 

where A := 1 + S log S + (1 — 5)log(l — 6). The com- 
putation is called <5-reliable if the root outputs correctly 
with a probability 1 — 5 (with 5 < 1/2). This theorem 
provides stricter conditions than the original proposal by 
Von Neumann [12, 13]. 

By definition, smaller eg means larger noise, and the 
condition (ii) is for the cases with larger noise such that 
only functions with a smaller number of inputs can be 
reliably computed. Immediately, we see that informa- 
tion causality implies a large computational noise for the 
RAC circuit such that only condition (ii) for reliable noisy 
computation can possibly be fulfilled. As a result, Alice's 
output asymptotically becomes random because A — > 



and hence <5 — > \ as n — > oo. In summary, this implies 
that information causality prevents any physically realiz- 
able (n, k, I) -circuit from achieving reliable computations 
of excessively complicated functions, i.e., with either too 
many inputs or lengthy steps needed. 

The above result applies only when classical communi- 
cation between Alice and Bob is disallowed. Under such 
circumstances, the noise of the gate is intrinsically con- 
strained by the underlying physical theory. Otherwise, 
the classical communication can be exploited to improve 
the reliability of the gates so that the no-go result could 
be lifted. 



IV. CONCLUSION 

We show how information causality leads to Tsirelson 
bounds in a much easier way. A series of new Tsirelson 
bounds are then derived. These bounds provide some 
playground to test the information causality by experi- 
ments, as done before to test the Bell inequality. More- 
over, deep ramifications concerning non-local quantum 
computation are also found and discussed. Especially, 
the no-go theorem for the reliable nonlocal quantum com- 
putation deserves more study to clarify its physical im- 
plication. 

The authors acknowledge financial support from the 
NSC of Taiwan under Contract No. NSC.99-2112-M- 
033-007-MY3 and 97-2112-M-003-003-MY3. This work 
is partially supported by NCTS. 

Appendix 

In this appendix, we write down the detail of getting 
the Tsirelson- type inequalities derived from IC, and also 
check these inequalities directly by semidefinite program- 
ing (SDP). 

We review the the RAC protocol as follows. Alice has 
a database of k bits ao, a\, , , ak-i where s {0,1} is 
the random variable Vi € (0, • • • , k — 1). The distant 
Bob is given a random variable b e (0, ...k — 1) and a bit 
a sent by Alice. Bob's task is to guess ab- Here we will 
consider the RAC protocol with different settings. Case 
(a) is proposed in the main text. In case (b), Alice's and 
Bob's settings are modified. In the following, Alice's 
input is denoted by an A-bit string x — x\ . . . Xn- Let 

N 

x = 1 + ^2 l - 1 x l , 1 < x < 2 N . Bob's input is denoted 

j=i 

by N- bit string y = y x ...y N . 
• Case (a) 

Here N = k— 1, and Xi = a + ai \/i e {1, k — 1}. 
Vi=Sijb VzG {l,...,fc-l}, if b^O. y = Q if 6 = 0. 

N 

Let y = 1 + ^«J/i, 1 < y < k. In this case, the 
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Tsirelson-type inequality derived from information 
causality following the procedure in the main text 
is 

I J2 (-l)^,yl < 2^ Vfc. (10) 
{S},{y} 



• Case (b) 

Here N = k, xi — aj_i, and y.^ = Si^+i Vi G 

N 

{1, ...,fc}. Let y — Y^ij/i, 1 < y < k. Then, the 

i=l 

Tsirelson-type inequality from information causal- 
ity is 

I £ (-lf«C s , ff \<2 k Vk. (11) 
{3}-,{y} 

We now briefly introduce the semidefinite program- 
ming [14]. SDP is the problem of optimizing a linear 
function subjected to certain conditions associated with 
a positive semidefinite matrix X, i.e., v^Xv > 0, for 
v G C™, and is denoted by X y 0. It can be formulated 
as the standard primal problem as follows. Given the 
n x n symmetric matrices C and D^s with q = 1, • • • , m, 
we like to optimize the nxn positive semidefinite matrix 
X y such that we can achieve the following: 

minimize Trace(C T X) (12a) 

subject to Trace(D q X) = b q , q = 1, • • • , m . 

(12b) 

Corresponding to the above primal problem, we can ob- 
tain a dual problem via a Lagrange approach [17]. The 



Lagrange duality can be understood as the following. If 
the primal problem is 

minimize fo(%) (13a) 
S-t. f q (x) < 0, q G 1...TO. (13b) 
/i ? (z)=0, gel...p, (13c) 

the Lagrange function can be defined as 

L(x, A, v) = f (x) + ^ =1 \ q f q {x) + Y? q=l v q h q {x), (14) 

where Ai,. . . , A m , and f lv . . ,z/ p are Lagrange multipliers 
respectively. Due to the problem and (14), the minima of 
/o is bounded by (14) under the constraints when Ai,. . . , 
A m >0. 

inf fo > inf L(x, A, v) . 

X X 

Then the Lagrange dual function is obtained. 

g(X, v) = inf L(x, A, v). 

X 

g(\, v) < p (p is the optimal solution of fo(x) ), for Ai,. . . , 
A m > and arbitrary u\,. . . ,v p . The dual problem is 
defined. 

maximize g(X, v) (15a) 
s.t. X q >0. (ge{l...m» (15b) 

We can use the same method to define the dual problem 
for SDP. From the primal problem of SDP (12), we can 
write down the dual function by using minimax inequal- 
ity [18]. 



inf Trace{C T X) = inf Trace(C T X) + ^y q (b q - Trace(DjX)) 

X -° X -° 9 =1 

m m 

= j£„ SU P E VM + Trace((C T £ y q D T q )X) 

y q=l q=l 
m m 

> sup inf VM + Trace{(C T - £ Vq D T q )X) 

V - q=l 9=1 

m m 

= SU P 1^ E + 7>a Ce ((C - £ y q D q ) T X). (16) 

V - q=l 9 =1 

The optimal solution of dual function is bounded under some vector y [20] . 

m mm m 

sup .inf £ y q (b q ) + Trace((C -£ y q D q ) T X) = sup £j/,(6 g ) when C-^!/,O g ^0 

y ~ 9=1 9=1 a 9=1 9=1 

— oo otherwise. (17) 
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The corresponding dual problem is 

m 

maximize ^^Uqi^q) (18a) 
9=1 

m 

s.t. S = C -^2y q D q hO. (18b) 

9=1 

If the feasible solutions for the primal problem and the 
dual problem attain their minimal and maximal values 
denoted as p' and d! respectively, then p' > d' , which 
is called the duality gap. This implies that the optimal 
solution of primal problem is bounded by dual problem. 
This then leads to the following: Both the primal and 
the dual problems attain their optimal solutions when 
the duality gap vanishes, i.e., d' = p'. 

We now use SDP to check the Tsirelson-type bound. 
To cast the above problem of finding the Tsirelson's 
bound in the context of quantum mechanics, we need to 
use Tsirelson's theorem [15]. It says that for any quan- 
tum state 1^) € A B shared by two observers Alice and 
Bob with their measurement outcomes being A x € [—1,1] 
and By e [—1,1], respectively. The correlation function 
can be expressed by the inner product of two real unit 
vectors a x , P y £ R t+V . Therein, t and v are the numbers 
of Alice's and Bob's measurement settings, respectively. 
In detail, Cs,y used in (10) or (11), the Tsirelson's the- 
orem guarantees that we have C^j = a x ■ P y . Then, we 
can cast the problem of finding the Tsirelson bound in 
(10) or (11) into the following form of optimal problem 
for SDP, 

maximize j ^ (— l) x '^a x ■ (3 y | (19a) 
{S}-,{y} 

s.t. 11^11=11^11=1, Vi,y. (19b) 
Then, the associated dual problem is 



We now will turn the problem (19) into the primal prob- 
lem (12) by constructing the matrices X, C and A^s from 
the unit vectors ct x and P y . Following the way in [16], 
the mapping is as follows. Define the matrix P whose 
columns are vectors (ai, at, Pi, ■ ■■P v )- Then the SDSP 
matrix X is given by P T P, which can be put into the fol- 
lowing block form 



x - (g h) 

where the matrix elements of each block are Eij — en ■ aj , 
Fib = ai ■ (3b , G a] = Pa ■ aj and H ab = P a • fib with 
i,j = l,--- ,t (t = 2 N ) and a, b = l,--- ,v (v = k). 
Note that F and G are used in (19), and instead E and 
H are used in (19b). Therefore, we can write down the 
matrices C and DqS accordingly so that the problem (19) 
is equivalent to the problem (12). It is easy to see that 
C is a matrix with only non- vanishing off-diagonal block 
of matrix elements given by (— l) x ' y , and DqS are the 
diagonal matrices with {D q ) st = 6 Stq 5t iq . We omit their 
detailed forms here. 

We take k = 2 and k = 3 in case(a) for example. 
• k=2 

Here x = x\ and y — y\. According Eq. (10), wc 
want to maximize |Co,o + Co,i +Ci,o — Using 
the Tsirelson theorem, it is equivlaent to maximiz- 
ing ai ■ Pi + ai ■ p2 + Q!2 • Pi — ct2 • @2 ■ Such Tsirelson 
bound has been showed by Wehner [16] using SDP. 
We just show the numerical result. For more de- 
tails, please see [16]. After using SeDuMi program 
[19] to solve SDP, the optimal for both primal and 
dual problem is 2.8284. It is consistent with the 
Tsirelson bound [7] (2y/2) for the case two settings 
per site. 



minimize 



X> (20a) 



9=1 



s.t. S = VqD q - C h 0. (20b) 

9=1 



• k=3 Here x = xix 2 and y = 2/12/2- Notably, y € 
{00, 10, 01}. The problem which we want to solve 
is 



maximize |Coo,oo + Coo, 10 + Coo, 01 + Coi,oo + Cnao — Coi,oi 
+Cio,oo — Cio,io + Cio,oi + Cn,oo — Cn,io — Ci 1,01 1 
= maximize ai ■ Pi + ai ■ P2 + ai ■ P3 + a% ■ Pi + a% ■ P2 — ce^ ■ P3 

+a 2 ■ Pi - a 2 ■ P 2 + a 2 ■ P 3 + a 4 • Pi - a 4 • P 2 ~ «4 • Pi- (21) 

I 



The X matrix for primal problem is X = S T S 



where the columns of S correspond the unit vec- 



tors (ai,a2,a3,a 4 ,/3i,#2,/33)- 



X 



I OL\ ■ OL\ OL\ ■ U 2 OL\ • OL 3 Qfi • Qf4 OL\ ■ (3i U\ ■ p2 Ctl 
a 2 ■ tti «2 • «2 «2 1 Q!3 a 2 ' "4 "2 ' fll Oi 2 ' /?2 «2 
Qf3 ■ ai 0:3 • «2 «3 1 C*3 <^3 • «4 0:3 ■ /3i 03 • /?2 



/3 3 \ 

ft 

a 3 ■ P 3 

Qf4 • ai 0:4 • «2 «4 • Oe 3 014, • OL4 014, ■ /3i 0.4 - 02 OL4 ■ /?3 

(3i ■ on Pi- a 2 Pi • «3 Pi ■ a 4 Pi - Pi Pi- Pi Pi 
Pi ■ ai p 2 - a 2 P 2 -a 3 p 2 ■ a 4 p 2 - Pi p 2 ■ p 2 p 2 
\ ■ 0:1 Pz- a 2 P 3 



a 3 p 3 -a 4 p 3 -Pi p 3 -p 2 P 3 -P 3 / 

r 



According to (21), the matrix C is denned 














1 


1 


1 


\ 














1 


-1 


1 














1 


1 


-1 
















1 


-1 


-1 




1 


1 


1 


1 













1 


-1 


1 


-1 













\1 


1 


-1 


-1 











/ 



The norm of the vectors (ai, a 2 , a 3 , a 4 , Pi, P2, P 3 ) 



must be one is the source of the constrain. Each 
of the matrix D q (q = 1...7) is a 7 x 7 diagonal 
matrix with the q-th diagonal element being one 
and zero others. The value b q (q = 1...7) is one. 
The numerical result shows that the tight bound is 
6.9282, which essentially agrees with (10). When 
we get the optimal solution, the correlation func- 
tion matrix is 



/ 1.0000 
0.3333 
0.3333 
-0.3333 
0.5774 
0.5774 
0.5774 



0.3333 
1.0000 

-0.3333 
0.3333 
0.5774 

-0.5774 
0.5774 



0.3333 
-0.3333 
1.0000 
0.3333 
0.5774 
0.5774 
-0.5774 



-0.3333 
0.3333 
0.3333 
1.0000 
0.5774 
-0.5774 
-0.5774 



0.5774 
0.5774 
0.5774 
0.5774 
1.0000 
0.0000 
0.0000 



0.5774 
-0.5774 

0.5774 
-0.5774 

0.0000 

1.0000 
-0.0000 



0.5774 \ 

0.5774 
-0.5774 
-0.5774 

0.0000 
-0.0000 

1.0000 



X satisfying the constraint that X is SDSP with 
non-negative eigenvalues [20]. 

For the case k = 3 to k = 8 

After setting up the SDP for finding the Tsirelson 
bound, we still use the package named SeDuMi to 
solve it for both case (a) and (b) with any value 
of k. The result agrees extremely well with the 
bound obtained from information causality up to 
O(10~ 4 ). To be more concrete, the numerical re- 
sults are shown below: for case (a) up to k = 8, we 
have 



k 


3 


4 


5 


6 


7 


8 


SDP 


6.9282 


16.0000 


35.7771 


78.3837 


169.3281 


362.0387 



This agrees extremely well with the RHS of (10). 
Similarly, for case (b) up to k = 8, we have 



k 


3 


4 


5 


6 


7 


8 


SDP 


13.8564 


32.0000 


71.5542 


156.7673 


338.6562 


724.0773 



It again agrees extremely well with (11). Therefore, 
based on our numerical simulation, information causal- 
ity indeed singles out the Tsirelson bound of a physical 
theory such as quantum mechanics. 
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